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(54) Tide: METHODS OF SELECTING A RANDOM PEPTIDE THAT BINDS TO A TARGET PROTEIN 



(57) Abstract 

A method of isolating a random peptide-scaffold peptide fusion protein that binds a predetermined target protein is disclosed. First, 
a library of nucleotide sequences encoding random amino acid sequences are generated and cloned into a first expression vector to make a 
library test vector. The nucleotide sequence is expressed within a yeast cell as a test fusion protein containing a transcriptional activation 
domain, a random peptide domain, and a scaffold peptide domain. The DNA sequence encoding a target protein cloned into a second 
expression vector provides a target vector. The expressed target fusion protein contains a DNA-binding domain and a target protein. Test 
fusion protein-target fusion protein interaction is detected according to a two-hybrid protein-protein interaction assay. Also disclosed are 
an HIV- 1 Rev protein binding peptide isolated by the method of the invention and methods of using the Rev interacting protein. 
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METHODS OP SELECTING A RANDOM PEPTIDE 
THAT BINDS TO A TAPP ET PRQTRTJI 

Statement as to Federally Snans^ red Res«>^ r 7 h 
5 This invention was made at least in part with 

funds from the Federal government, and the government 
therefore has rights in the invention. 

BacKqround of the Invention 
The invention relates to a method of selecting a 
io fusion protein which binds to a target protein of 

interest within a host cell. The invention also relates 
to a polypeptide that binds to the human immunodef iciency= 
virus (HIV)-l Rev protein, and methods of inhibiting HIV- 
1 production using the polypeptide. 
15 Protein-protein interactions within a cell are the 

basis for countless physiological control mechanisms. 
Viral control mechanisms are the targets of searches for 
the cellular host proteins with which a viral protein 
interacts as part of its function in the cell. 
20 The "two-hybrid" assay, is a genetic screen 

originally described by Fields et al. (Fields S. et al. 
(1989) Nature, 2±p.:245-246; Fields, S. and Sternglanz, R. 
(1994) Trends In Genetics, Afl: 286-292) to test protein- 
target protein interactions. The "two-hybrid" selection 
2S system utilizes the interaction between two proteins of 
interest to reconstitute an active transcription factor 
from two fusion proteins; one containing a DNA binding 
domain fused to the protein of interest, and the other 
containing a transcriptional activation region fused to a 
30 cDNA library (Chi en C. T. et al. (1991) P.N.A.S. USA, 

&fi: 9578-9582; Dal ton, s. et al. (1992) Cell, 6_fi: 597-612) . 
Test protein-target protein interaction is indicated by 
expression of a marker gene operably linked to a promoter 
activated by the reconstituted transcriptional factor. 
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Though useful, th. two hybrid sysfa cannot assay 
S anH! T 0tei " inta " Cti »"»- According to Fi.ids, 
I'olTL ' (1994 '' ™' proteins cannot 

5 selection by this system. 

deoend ° bligate intracellular parasites and 

depend on protein-protein interactions between viral and 
host cell, proteins to carry out functions vital to their 
life cycle. Blocking or inhibiting these critical 
io interactions is a therapeutic strategy that may have 

profound effects on the viral pathogen and little effect 
on the host cell. Furthermore, as mutation of the viral 
protein may be constrained by the requirement for 
functional interaction with host factors, such a strategy 
15 may be resistent to viral escape mutations. 

Current strategies to inhibit replication of HIV 
are commonly aimed at the virus' reverse transcriptase 
using nucleoside analogues like AZT, ddc and ddl 
However the complex life cycle of this retrovirus offers 
20 unique targets of intervention in the form of small 
regulatory proteins that are now being intensively 
studied, one of these proteins. Rev, is a regulator of 
virion protein expression. 

Rev was first identified in proviral mutants that 
25 were unable to replicate because they did not produce 
detectable levels of Env and Gag proteins 
(Sodroski et al. (1986) Nature, 321:412-417; 
Peinberg et al. (1986) Cell, 4^:807-817). it is now 
clear that Rev induces the synthesis of these structural 
>o proteins by allowing the appearance of their mRNAs in the 
cytoplasm (Malim, M. et al. (1989) Nature, i^fi: 254-257; 
Emerman, N. et al. (i 98 9) cell, 52:ii55-ii65) . since the 
mRNAs coding for Gag, Env, and Pol contain introns with 
sequences to which the Rev protein specifically binds, 
5 these mRNAs are normally translocated to the cytoplasm 
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for translation to their respective proteins during HIV 
infection. 

In the absence of Rev protein function , the mRNAs 
encoding Gag, Env, and Pol remain in the nucleus, are not 
5 expressed, and HIV-l replication is inhibited. As no 
cellular protein is known to perforin a similar function, 
and because the concentration of the Rev protein must 
exceed a threshold for function (Pomerantz, R.J. et al. 

(1992) J. Virology, £&: 1809-1813) , Rev is considered a 
10 prime target for specific antiviral agents (Cohen, J. 

(1993) Science, igfl:1257; Rosen, C.A. (1992) AIDS 
Research and Human Retroviruses, £: 175-181; Duan, L. et 
al. (1994) P.N.A.S. USA, 91x5075-5079) - 

Rev is a small 116 amino acid phosphoprotein that 
is is located predominantly in the nucleus /nucleolus of 
human cells infected with HIV-l (Cullen, B. R. et al. 

(1988) J. Virol., £2:2498-2501; Pelber, B. K. et al. 

(1989) P.N.A.S. USA, SAz 1495-1499) . Extensive 
mutagenesis has revealed that Rev contains at least two 

20 domains that are necessary for function: l) a basic RNA 
binding domain at the N-terminus that binds to a specific 
RNA sequence (RRE, Rev Response Element) present in all 
unspliced HIV mRNAs, and 2) a C-terminal effector domain 
(Malim, H. et al. (1989) Cell, 51:205-214; Olsen, 

25 H.Sv et al. (1990) Genes and Dev., 1:1357-1364) necessary 
for function. The effector domain was defined by 
mutations, e.g., M10, that not only abolish Rev function 
but at the same time create transdominant negative 
mutants, i.e., mutants able to repress wild type Rev 

30 function when added in a molar excess (Malim, M. et al. 
(1989), supra). The M10 mutation does not affect the 
normal multimerization of Rev nor the ability of Rev to 
bind RNA, suggesting that the region of Rev containing 
the M10 mutation interacts with one or more cellular 

35 proteins (Malim, M. et al. (1991) J. Virol., £5:4248- 
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4254) and that the interaction with a putative cellular 
target is essential for Rev function. 



Summary of tft e ^^T^lTTl 
In general, the invention features a method of 
5 selecting a random amino acid sequence that binds to a 
target protein in a host cell, e.g., a yeast cell. The 
invention is based on the unexpected discovery that a 
stably folded polypeptide domain (a "scaffold") fused to 
the random amino acid sequence enhances such selection. 
10 In addition, the invention features a random peptide 
detected by the method of the invention, which is not 
naturally expressed in the cell from which its transcript 
was isolated, and which unexpectedly exhibited specific 
binding to the Rev protein effector domain. 
15 It is an object of the invention to enhance the 

ability to isolate random peptides which bind to a target 
protein of interest by expressing the random peptide 
within a known stable 3-dimensional structure, or 
scaffold, as a fusion protein. This fusion protein is 
20 expressed from a nucleic acid sequence encoding a random 
sequence of amino acids covalently linked to a scaffold 
peptide. The stably folded structure of the scaffold 
enhances and/or enables binding of the random peptide to 
the target protein. Binding of the random peptide- 
25 scaffold peptide fusion to the target protein of interest 
is monitored by a two-hybrid selection method (Fields, S. 
et al. (1989), supra). 

In general, the scaffold is a peptide sequence 
that stably folds into a 3-dimensional structure and 
30 which, when fused to a random amino acid sequence of the 
invention, enhances presentation of the random sequence 
to the target protein. Specific embodiments of the 
scaffold peptide include nucleotide sequences which 
encode and allow translation of an Adh C-terminus (SEQ ID 



WO 96/23899 

PCT/US95/09589 



NO:3);_the Bl domain of G protein (Gronenborn, A.M. 
(1991) Science, 252:657-661); a zinc finger domain (Lee, 
M.S. (1989), Science, 245:635); the B domain of protein A 
from S. aureus (Gouda et al. (1992) Biochemistry il:9665- 
5 9672); the tenth type III module of fibronectin (Baron et 
al. (1991) Biochemistry £1: 2068-2073 ) ; and cyclophilin 
(Ke et al. (1991) PNAS USA fifi: 9483-9487) . The random 
peptide can be linked at either end, or preferably in the 
middle, and most preferably in external loops of the 
10 scaffold, making the random peptide accessible to the 
target protein for binding. 

Thev invention further features a non-natural 
fusion protein, RIP (fiev interacting srotein, SEQ 1D 
NO: 2), isolated by expressing a non-naturally translated 
is peptide fused to a scaffold peptide domain, which fusion 
peptide binds to the HIV-1 Rev protein and inhibits its 
function. 

In addition, the invention features a 
substantially pure DNA sequence (SEQ ID NO: 4), referred 
20 to herein as the "R32 nucleotide sequence, 9 * and the 

deduced amino acid sequence (SEQ ID NO: 5). This sequence 
does not include any polylinker or scaffold sequences. 

The invention also features a Rev protein binding 
protein encoded by a DNA sequence of SEQ ID NO: 4 linked 
25 to a scaffold peptide-encoding sequence. 

The invention also features a vector containing, 
in the order of transcription, a nucleotide sequence 
encoding one member of a two-hybrid binding peptide pair 
(i.e. a transcriptional activation domain (Gal4 or Herpes 
30 virus VP16, for example) ; a nucleotide sequence encoding 
a random peptide sequence to be tested; and a nucleotide 
sequence encoding a scaffold peptide domain. 

In addition, the invention features a second 
vector containing, in the order of transcription, a 
35 nucleotide sequence encoding a DNA binding domain (Gal4 
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or LexA, for example) ; and a nucleotide sequence encoding 
a target protein. It is understood by one of ordinary 
skill in the art that the transcriptional activation 
domain or the complementary DNA binding domain of a two- 
hybrid binding peptide pair can be varied in position 
with its respective fusion protein as necessary to 
improve transcriptional activation. 

One of the vectors of the two-hybrid system 
contains a nuclear localization signal to allow 
translocation of the complex of interacting fusion 
proteins into the nucleus for interaction with the 
transcriptional activation sequences of the marker gene. 
It is also noted that vectors used in selecting a random 
peptide in the context of a scaffold peptide domain can 
encode the random peptide 5' to the scaffold peptide 
domain (in the order of transcription) , or the random 
peptide-encoding sequence can be inserted into the coding 
sequence of the scaffold peptide domain. 

The invention also features an antibody to the Rip 
protein. The antibody may be a monoclonal antibody or a 
polyclonal antibody. For use in assays of the invention, 
the anti-RIP antibody, or the RIP itself, may also be 
labeled for detection such as with a fluorescent moiety 
covalently attached to the antibody, or peptide, without 
interfering with antibody binding to the RIP. 

In addition, the invention features an assay for 
the qualitative or quantitative detection of Rev protein 
in a cellular lysate, by inducing a population of cells 
containing or suspected of containing HIV to proliferate, 
thereby inducing HIV to replicate and produce Rev 
protein. The cells are lysed and the proteins of the 
lysate are contacted with RIP, e.g., immobilized on a 
solid support, to allow binding of Rev. The presence of 
the RIP-Rev complex is indicated by a decrease in the 
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detection signal observed when Rev is present compared to 
a control sample having no Rev protein. 

The invention further features a method for 
screening candidate agents that inhibit HIV replication 
5 by inhibiting Rev production. Human cells containing 

actively replicating HIV are contacted with the candidate 
agent in an appropriate culture medium. As measured, 
e.g., by the RIP/anti-RIP antibody assay, a decrease in 
the cellular concentration of Rev in treated cells 
10 compared to a control without the candidate agent, 
indicates that the agent is potentially useful for 
inhibiting HIV replication in human cells. 

The invention also features a method of inhibiting 
HIV reproduction in lymphocytes of a mammal (preferably a 
is human) by introducing DMA encoding RIP into the 
lymphocytes where the ONA is expressed and RIP is 
produced in amounts sufficient to inhibit HIV 
reproduction. Moreover, the invention features a method 
of inhibiting the production of HIV-l in human cells, 
20 e.g., therapeutically or prophylactically, by 

administering the RIP protein to the cells using gene 
therapy techniques. 

By "random peptide" or "random amino acid 
sequence," is meant any peptide that does, or does not, 
25 naturally exist in a cell. For example, a random peptide 
according to the invention can be generated by the 
translation of a nucleotide sequence that does not encode 
a naturally occurring full-length protein, or by 
synthetic methods. The nucleotide sequence encoding a 
30 random peptide is encoded, for example, by a cDNA 

sequence isolated by reverse transcription of a cellular 
sense or antisense mRNA transcript, or by a synthetic 
oligonucleotide containing amino acid codons linked in 
random order. Examples of a nucleotide sequence encoding 
35 a random peptide include, but are not limited to: 1) 
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cellular antisense mRNA reverse transcribed into cDNA 
then expressed as a polypeptide which binds to a target 
protein of interest, and 2) a polypeptide generated by 
the expression of a nucleic acid sequence encoding a 
randomly generated amino acid sequence. 

A "stably folded peptide" is a peptide that is 
resistant to denaturation, e.g., thermal denaturation, in 
a cell. 

By "reduced HIV-l protein production in cells 
expressing the virus" is meant a decrease in the 
expression of, for example, essential proteins Gag, Env, 
and Pol in RIP-containing cells compared to the normal 
expression in the same cells lacking RIP. 

By "sufficient concentration of HIV-l Rev binding 
is peptide" is meant a concentration of RIP in a cell above 
the threshold necessary to inhibit translocation the Gag, 
Env, and Pol mRNAs from the nucleus into the cytoplasm. 

By "substantially pure random peptide-scaf fold 
peptide fusion protein" is meant that the random peptide- 
scaffold peptide fusion protein provided by the invention 
is at least 60%, by weight, free from the proteins and 
naturally-occurring organic molecules with which it is 
naturally associated. Preferably, the preparation is at 
least 75%, more preferably at least 90%, by weight, 
25 random peptide-scaf fold peptide fusion protein. A 

substantially pure random peptide-scaf fold peptide fusion 
protein may be obtained, for example, by expression of 
the recombinant nucleic acid of the library test vector 
encoding it, or by chemically synthesizing the protein. 
30 Purity can be measured by any appropriate method, e.g., 
column chromatography, polyacrylamide gel 
electrophoresis, or HPLC analysis. 

By "substantially identical amino acid sequence" 
is meant an amino acid sequence that differs from another 
35 sequence only by conservative amino acid substitutions, 



20 
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for example , substitution of one amino acid for another 
of the same class (e.g., valine for glycine or arginine 
for lysine) , or by one or more non-conservative amino 
acid substitutions, deletions, or insertions at positions 
5 of the amino acid sequence that do not destroy the 
biological activity of the random peptide-scaf f old 
peptide fusion protein. 

By "operably linked- means that selected DNA is in 
proximity with transcriptional and translational 

10 regulatory sequences to allow these sequences to regulate 
expression of the selected DNA* 

Unless otherwise defined, all technical and 
scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art 

is to which this invention belongs. Although methods and 
materials similar or equivalent to those described herein 
can be used in the practice or testing of the present 
invention, the preferred methods and materials are 
described below. All publications, patent applications, 

20 patents, and other references mentioned herein are 

incorporated by reference. In addition, the materials, 
methods, and examples are illustrative only and not 
intended to be limiting* 

Other features and advantages of the invention 

25 will be apparent from the detailed description, and from 
the claims. 

Brief Description of the Drawings 
Fig* 1 is a diagram of the results of HIV antigen 
production and Rev binding experiments comparing 
30 characteristics of wild type Rev protein and various 

mutant Rev proteins. Data for HIV-1 gtat and p24 antigen 
production were obtained from Malim, et al* (1991), 
supra* RIP-Rev binding experiments were performed by the 
method of the invention. 
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Pig. 2 is a diagram of R32 clones used to evaluate 
regions necessary for Rev binding activity. 

Pig. 3A is a diagram showing various N-terminal 
deletions of the R32 nucleotide sequence starting at the 
s ZcoRl site. The XhoT. site was filled in with a- 
thionucleotides to achieve unidirectional deletions 
according to standard methods (Henikoff (1984) Gene, 
2£.: 351-359) . 

Pig. 3B is a diagram showing the structure of the 

io initial test vector construct and of active test vector 
constructs after bidirectional exonuclease in deletions. 
The deletions were started from the Xbal site. 

Fig. 4 is a representation of the nucleotide and 
deduced amino acid sequences of the Rev interacting 

is protein, RIP. The RIP coding sequence consists of the 
downstream 175 nucleotides of the originally isolated R32 
sequence (from the human MCM2 homologue in antisense 
orientation) linked to OKA linker sequences from the 
cloning vector cDNA, 15 nucleotides from the yeast ADH 

20 gene, and 108 nucleotides encoding 35 amino acids plus a 
stop codon (SEQ ID NO: 3) from the Adh C-terminus. The 
shaded portions have been found to be dispensable for 
binding to Rev in C-terminal deletion experiments, but 
were included for cloning convenience in the experiments 

25 expressing RIP in mammalian cells. 

Fig. 5 is a graph showing the inhibition of HIV-1 
gag p24 antigen production by increasing amounts of RIP- 
encoding DNA introduced into HeLa cells by cotransfection 
with HIV-1 infectious molecular clone, pNL4-3 (Adachi, 

30 A. et al. (1986) J. Virol. 12:284-291). 

Figs. 6A and 6B are graphs showing the inhibition 
of HIV production in CEM lymphocytes by RIP protein (Fig. 
6A) ; and showing that RIP- induced inhibition is specific 
to HIV-1 Rev since RIP does not affect expression of 

35 another protein, alkaline phosphatase, in the CEM 
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lymphocytes (Fig. 6B) . The results also show that 
inhibition of HIV-l production by RIP is not caused by 
non-specific cell toxicity of RIP since the expression of 
a control gene, alkaline phosphatase, is unaffected. 
5 Fig. 7 is a graph showing the specific and dosage 

dependent inhibition of HIV-1 production in H9 cells by 
RIP. The results show that the inhibitory effect of RIP 
is due to expression of the RIP peptide and is not caused 
by inhibition of translation due to an antisense effect. 

io Fig. 8 is a diagram of a method of assaying for 

the presence of Rev in a cell lysate by contacting the 
cellular and viral proteins of the cell lysate with RIP 
immobilized on a solid support. Detection is 
accomplished by the relative amounts of anti-RIP antibody 

15 that is excluded from binding to the support due to the 
presence of the Rev-RIP complex. 

Detailed Description 
The following examples are set forth so as to 
provide those of ordinary skill in the art with a 

20 complete disclosure and description of how to select, in 
a host cell, a fusion protein containing a random amino 
acid sequence fused to a scaffold protein domain, which 
fusion protein binds a specific target protein. Also 
disclosed is such a fusion protein, RIP, and methods of 

25 using the same* Such disclosures are not intended to 
limit the scope of the invention. 

As originally presented, the two-hybrid system 
exploits the ability of a pair of interacting proteins to 
bring a transcription activation domain into close 

30 proximity with a DNA-binding site that regulates the 
expression of an adjacent reporter rene (Fields, S. and 
Song, O.-K. (1989) Nature, 24&:245-246) . In general, the 
two-hybrid assay is performed in yeast cells and requires 
the construction of hybrid genes to encode 1) a DNA 
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binding domain fused to a protein x- o» 

domain fused to a protein v ^ > *" activation 

tneDNAbinding site, and the interaction of the 
actxvataon domain with the transciptional machinery 
thereby activating transcription of the reporter^ 
in the general assay, to select a protein Y, that hinds 
to a target protein X, Y is encoded by a pool of p^rnods 
10 in which total cONA or genomic ONA is lighted to the 
activation domain (Fields, S. and Sternglanz, „. (1994) 
Trends in Genetics, JJi: 286-292) . 

The following examples provide support for the 
discovery that a scaffold peptide domain fused to a 

" 22t°« PePtlde €nhanCeS bindin * Election of the random 
peptide in a host cell, e.g., a yeast cell. 

Examnl » ? 

EX*PPJ» IT ft Mot-hoi nf T S nl a ^ n n Ran ^ m p ^^. 
20 Proteip 

Selection of a random peptide in the -two-hybrid- 
assay is limited by the ability of the random peptides 
to fold correctly within the cell (Fields, s. and 
Sternglanz, R. (1994) Trends in Genetics, AS: 286-292) . 

T!/ ahd ° m PePtide Sequences «• unlixely to fold into a 
stable compact protein. As described in the examples, 
the unexpected finding that a Rev interacting protein 
required an Adh scaffold sequence (SEQ id NO: 3) for 
binding demonstrates a need for a stably folded scaffold 
30 peptide to be fused to the random amino acid sequence to 
be selected. 

To provide a useful scaffold peptide domain, the 
Adh transcription termination sequence was expressed as a 
read-through sequence which accompanied transcription of 
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the random amino acid sequence to be selected by the 
method. The observed binding requirement for the Adh 
transcription termination sequence indicated that the Adh 
sequence was necessary as a molecular scaffold that 
5 stabilized the entire fusion protein, and thus the random 
peptide. Thus, the 35 amino acids, 

LVKSPIKWGLSTLPEIYEKMEKGQIVRYWDTSK (SEQ ID NO: 3) from 
the Adh transcription termination sequence are required 
to be present in the expressed fusion protein to serve as 
10 the scaffold for the purpose of structural presentation 
of a fused random peptide in a two hybrid selection 
system. 

Another scaffold protein sequence useful in the 
method of the invention is the Bl domain of the 

is Streptococcal protein 6. This protein domain is. a single 
immunoglobulin binding domain, and the folded structure 
has been extensively studied (Gronenborn, A.M. et al. 
(1991) Science, 253 :657-661) . but never used in the two- 
hybrid system. The Bl protein domain forms a stably 

20 folded structure which is not denatured below a 
temperature of 87 °C, exhibits reversible thermal 
denaturation, and maintains its native conformation in 
aqueous solution containing up to approximately 8 M urea. 
DNA encoding a random peptide is preferably 

25 inserted between amino acids 9 and 12 of the Bl domain at 
the site of a loop between two beta sheets (see 
Gronenborn, A.M. (1991), supra, for Bl structural 
analysis) , thereby providing optimal presentation of the 
random peptide for binding to the target protein. The 

30 transcriptional activation domain is fused to the N- 
terminus of the Bl domain* 

Other scaffold peptides described above can be 
used in a similar fashion. 

In general, libraries of test vectors are 

35 constructed in which each vector contains, in the 
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LTaT^ ::izzr encoding a sc — — 

s Bi domain oAt^:™i protein ^ ' ( Gronenborn ' aV^ 
(».!) science, ^: 657 - 661) . A target 
constructed which contains, in the order of 
transcription, the Gal4 DNA binding domain linked to the 
coding sequence of a target protein of interest, e g 
10 the entire Rev coding sequence. 9 ' ' 

Gal^DNA^i n elatiVe POSiti ° n ° f *** targSt do »* in •»* 
bindxng domain can be reversed and still 

generate a functional target fusion protein. Also, as 

is Ztl iHT' ^ bindin9 d0nalnS SUCh - -n be 

I*** in aS8aV « In addition, the yeast expression 

vectors of the invention contain appropriate 
transcriptional and translational regulatory sequences 
and nuclear localization signal coding sequences for 
expression and nuclear localization of the test and 

20 target fusion proteins. 

The interaction of the test random amino acid 
sequence and the target domain in a cell provides a 
functional GAM transcriptional activator resulting in 
the expression of a marker gene (e.g. lacZ) under the 

Sterl^" 0 " 1 C ° ntr01 ° f * ^ Pr0n0t ~- A P~^ve 
xnteraction is detected as blue yeast colonies on yeast 

culture solid medium containing X-Gal. 

As the length of the random amino acid sequence 

3 o inr^ 8 ^' nUmber ° £ * ossi "* random fusion proteins 

30 increases exponentially. For example, a random 20 amino 
acid sequence can have any one of 20 naturally occurring 
amino acids at each position, or 20*0 different sequences. 
The Rev effector domain as an example of a target domain 
is likely to bind multiple sequences, increasing the 
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probability of detecting a binding reaction in the 
millions of independent clones screened in each assay. 

Example 2; Cloning of a Prot ein That Binds to 

Effggtor Domain Of the HIV-l Rev Protein 
5 The search for a Rev effector protein was 

performed by initially constructing a HeLa cDNA library 
by standard techniques well known to those of ordinary 
skill in the art (HeLa S3 ATCC #CCL 2.2). 

The HeLa cDNA library vector , pCF128, was 
10 constructed as follows. The BamHI fragment for DB20 
(Becker et al. (1991) P.N.A.S. USA, ££:1968-1972) 
containing an ADH promoter and transcription terminator 
expression cassette was inserted into YEPlaclSl (Gietz 
and Sugino (1988) 21:527-534), which had ben 

15 digested with ffindlll and EcoRI and filled in with Klenow 
enzyme, creating intermediate plasmid pCF124. A PCR 
product encoding VP16 amino acids 411-500 (Triezenberg et 
al. (1988), Genes and Dev., 2:718) was inserted into the 
ffindlll site of pCF124 creating intermediate plasmid 
20 pCFl26. A fTindlll fragment containing the polylinker of 
DB20 (Becker, (1991) , supra) was inserted into the 
ffindlll site of pCF126 downstream of the VP16 coding 
region. This generated pCF128 that contains within the 
yeast vector, YEPlaclSl, the ADH promoter, the VP 16 
25 activation domain, a polylinker, and the ADH 

transcription terminator which encoded the Adh O terminal 
amino acids that comprise the scaffold. All cloning 
steps were done by standard methods (see, e.g., Ausubel 
et al., eds. (1987) Current Protocols in Molecular 
30 Bioloqy r John Wiley publishers) • 

HeLa cDNA was prepared using the Riboclone cDNA 
synthesis kit (Pr omega Corp. , Madison, WI) using random 
hexamer primers. Blunt end cDNA, ligated to a BstXl 
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adapter, was inserted into the flstXI sites of P CF128 to 
generate the test library. 

The library contained approximately 4xl0 6 
independent clones. This library was transformed into 
the yeast strain GGYl (Gill, G. et al. (1987) cbII 
51: 121-126) containing an integrated 2acZ reporter under 
GAL4 control and expressing a Gal4 (1-147) -Rev fusion. 

Prom approximately 2xl0 6 yeast transf ormants, 27 
primary positive clones were recovered. Further 
screening of the primary clones eliminated false positive 
clones. Candidate plasmids were retransf ormed into GGYl 
together with constructs expressing either the Gal4 DNA 
binding domain Gal4 (1-147) alone, the Gal4 fusion Gal4- 
Rev, or the fusion Gal4-Rev (M10) (Table l) . 
ls Table l 

Class of Clones 

^ I II III 
Construc ts 

Gal4 (1-147) + 

20 Gal4-Rev + + + . 

Gal4-RevM10 + + 

Number of isolates: 13 12 1 

Of the 27 candidates, 12 library plasmids 
activated transcription when transformed into yeast 

25 together with either one of the Gal 4 -Rev fusions, but not 
together with Gal4 (1-147) (Class 11 clones) . These 
cDNAs presumably encode proteins that bind to parts of 
the Rev protein that are not affected by the M10 mutation 
in the effector domain. Only one of the plasmids (R32) 

30 was able to discriminate between wild-type Rev and Rev- 
M10, causing reporter gene activation together with Gal4- 
Rev, but not Gal4-Rev(MlO) (Class III clone) . This 
indicates a specific binding of the protein product of 
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R32 to- the effector domain of Rev. The R32 plasmid 
contains a 377 bp cDNA fragment inserted into vector 
PCF128 (supra). The expressed fusion protein contains 
the VP16 activation domain, the R32-encoded peptide, and 
5 the Adh scaffold peptide and was determined to be a Rev 
interacting protein in subsequent binding studies. 

Example 3: Specific ity of the Rev interaction protein- 
Rev Binding 

To further assay the binding specificity of the 

10 isolated Rev interaction protein to Rev, the R32 plasmid 
was transformed into yeast together with Gal4 fusions to 
a number of different Rev mutants that have previously 
been described (Halim et al. (1991), supra). Binding of 
RIP to Rev wild-type and mutants was assayed by the 

is method of the invention. The production of HIV-l gtat 
and HIV-l gag p24 assayed by Halim et al. (Malim et al. 
(1991) , supra) is tabulated in Fig. 1 to illustrate the 
activity of the Rev mutants relative to wild type Rev. 
All of the mutants tested contain mutations in the Rev 

20 effector domain between amino acids 78 and 90, but not 
all of these mutations abolish Rev function (Fig. 1) . 

A high correlation exists between the function of 
these Rev mutants and their ability to bind to the RIP 
protein encoded in part by the original R32 sequences 

25 (Fig. 1) . Mutants M20 and M25 that retain some Rev 
activity according to Malim et al. (1991), supra, bind 
RIP; whereas M10, and the single point mutants M27 and 
M29 that have no Rev activity did not show any detectable 
binding to RIP. Mutant M22 does not show a correlation 

30 between RIP binding and Rev function and this may be due 
to alteration of the amino acid and its charge at 
position 83 when the amino acid is changed from Leu to 
Asp. 
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in *h *** SeqUenCe Was <=°»Pared with sequences 

in the databank (Genbank) there was no exact match 

lZ*Z SOmB ^ h ° n0l °^ ™ f —d on 

(Van. H et al. (1991) Gejjes a „ d Deveiop 5;744 

L Zll Cl0nS UCDNA 1±brary ' 936201 ' St »ta g ene, 

La Jolla, CA) corresponding to this region showed the R32 

izzi:\t: antisense strand ° f ^ 

A Northern blot probed with a strand-specific rna probe 

vLldT " ieSSageS enC ° ding *- R32 ° Pen " adi »* *«- 
yielded no signal above background signal; whereas the 

same blot hybridized with a randomly radiolabeled R32 

is probe (i.e. both sense and antisense strands were 

radiolabeled) detected one moderately abundant 

M^n riP ? ° f C ° rres P° ndi »* i» -i- to the human 

MCM2 homologue. These findings indicate that the R32 
sequence is not part of a cellular protein coding 
10 transcript, but rather represents a sequence selected 
from the 'random' sequence pool of transcripts in 
antisense orientation present in the cDNA library. 

To confirm this hypothesis, fusion proteins 
expressed by various constructs were tested for Rev 
s interaction (Fig. 2) using the two-hybrid assay. Becker 
et al. reported that the Adh terminator contains a 
cryptic promoter capable of transcribing upstream 
sequences (Becker, D. M. et al. (1991) PNAS fifi:i968- 
1972) . Fusion of the human MCM2 gene in frame with the 
> VP16 activation domain did not result in transcriptional 
activation in yeast when cotransformed with the Gal4-Rev 
construct (Fig. 2 , construct 3). Thus, the R32 fusion 
protein, and not the hMCM2 homologue, is responsible for 
Rev binding. 
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Surprisingly, introduction of a stop codon 
directly after the R32 sequence also abolished Rev 
binding (Pig. 2, construct #2). Closer inspection of the 
sequence revealed that in the original clone there was no 
s stop codon after the cloned cDNA (R32) , adding to the 
open reading frame 16 amino acids from poly linker 
sequences and 35 amino acids from the C-terminus of the 
yeast Adh protein present in the Adh terminator fragment 
used in the library test vector. This was confirmed in 

10 construct #6 (Fig. 2) where R32 sequences taken from the 
antisense strand of the human MCM2 open reading frame 
clone (without Adh sequences) again failed to interact 
with Rev. If a R32-Adh fusion was cloned upstream of the. 
stop codon in clone #2 (Fig. 2), a functional protein was 

15 again obtained (Fig. 2, construct #5). A frame shift 
between VP 16 and R32 sequences also led to an inactive 
construct (Fig» 2, construct #4). 

Thus, the functional Rev interaction protein 
consisted of R32 sequences (derived from hMCM2 antisense 

20 transcript) fused to 51 amino acids from poly linker and 
Adh sequences (see Fig. 4). The fact that vector 
sequences were required for the interaction between R32- 
encoded amino acids and Rev indicates that R32 was not 
the physiological Rev target but actually represents a 

25 peptide sequence selectable when fused to a scaffold 
peptide domain, the Adh C- terminal sequence. 

Example 5; Pelineatipn of the Minimal Prptgin seeping? 
Required t <?r R?v Binding 

To determine the minimal amino acid sequence 
30 required for Rev binding, N- and C-terminal ExoIII 
deletions were performed using construct #5 (Fig* 2) 
which encoded a functional Rev interaction protein 
(containing a R32-Adh scaffold fusion) . Plasmid pools of 
different time points from unidirectional ExoIII deletion 
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experiments were co trans formed into yeast with the Gal4- 
Rev target vector. Plasmids were rescued from positive 
yeast colonies. The shortest functional peptide 
recovered from an N-terminal deletion series had 203 of 
s the 377 bp of the original R32 sequence deleted. The 
construct was transformed into yeast strains expressing 
Gal4(l-147), Gal4-Rev, and Gal4-RevM10 to assess binding 
specificity. The N-terminal deletion peptide not only 
retained full discrimination between Rev and RevMlO but 
io judging from the levels of fi-Gal produced by these 

strains, the interaction between the N-terminal deleted 
Rev interaction peptide and Rev was stronger than the 
interaction detected with the original Rev interaction 
protein isolate. 

is to determine the number of C-terminal amino acids 

that could be deleted from the original Rev interaction 
protein isolate without loss of Rev binding activity, a 
similar strategy of ExoIII deletions was used. These 
manipulations were again performed with clone #5 (Fig. 
20 2) . Surprisingly, the active constructs from late 

deletion time points all showed the same structure: the 
polylinker and the Adh domain upstream of the stop codon 
had been deleted. Also, all of the recovered active 
clones no longer had the stop codon and were now in frame 
25 with the downstream Adh domain. Since, in these 

experiments, no active clones were recovered which lacked 
the downstream Adh domain fused in frame with upstream 
R32 sequences, conclusions regarding a minimal active 
protein can be drawn. Only one amino acid can be deleted 
30 from the C-terminus of the R32 sequence, and the 

polylinker sequences are dispensable, whereas all 35 
vector-encoded amino acids (SEQ ID NO: 3) originating from 
the yeast Adh protein are necessary for an interaction 
with Rev. The minimal amino acid sequence, RIP (£ev 
35 interacting arotein, SEQ ID NO: 2), is shown in Fig. 4. 
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The original R32 clone was 377 nucleotide base 
pairs in length and encoded 125 amino acids. The two- 
hybrid library test vector polylinker added 16 amino 
acids and the last 35 amino acids were contributed from 
5 the Adh terminator in the expression cassette. Deletion 
analysis showed that 67 of the unique 125 amino acids as 
well as the polylinker sequences could be deleted from 
the construct without losing Rev binding ability. The 
minimal protein sequence therefore consists of 58 amino 
10 acids encoded by the hMCM2 antisense fragment fused to 
the Adh scaffold of 35 amino acids (SEQ ID NO: 3) of the 
Adh C- terminus. 

Example $; RIP Expression Inhibits HT V Viral Protein 

Expression 

is Rev is essential for HIV replication in human 

cells and the Rev effector domain is required for its 
function (Malim, M. et al. (1989), supra). The RIP 
peptide , containing a portion of a cDNA isolated from the 
random pool of normally untranslated messages fused to 

20 the Adh-derived scaffold, was tested to determine if its 
Rev binding ability correlated with an ability to inhibit 
Rev function. 

For these tests, the HIV-1 infectious molecular 
clone, pNL4-3 (Adachi, A. et al. (1986) J. Virol. 59:284- 

25 291) , which directs the expression of infectious HIV-1 
virions following transfection into a variety of cell 
types, was used as the source of HIV in human cells. 
Preliminary experiments were performed in HeLa cells 
(HeLa S3 ATCC #Cd* 2.2). HeLa cells were grown in 

30 monolayers in DMEM media (GIBCO, Grand Island, NY) 

supplemented with 10% fetal bovine serum. Cells were 
cotransf ected with pNI#4-3 and either a CMV 
promoter-driven expression vector (pcDNAl, Invitrogen, 
San Diego, CA) or the identical expression vector 
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containing the rip C DNA (SEQ ID NO-n A nrOTB , , 

the »j„httt ^ «o.l) (pcRlP) cloned into 

the ffafldlli and j?coRl sites of pcDNAl 

s phos Ph at ;rta (Chen c . et * ^ : — es 

632-638) using 5 „g of I/TR-CAT (chloramphenicol 
acetyltransferase) reporter construct, s L of ^ 

Plasmxd DNA, and carrier DNA. Suspension culturL werl 
transfected with DEAE-dextran (Margolis et al. mJ 3 " 
id Virology. ^370-374, using 4 „ of pNL4 . 3 ^ 
of test plasmid DNA. to assay HIV-x replication 
allots of culture media were sailed for detection of 
HIV-i gag P 24 protein (Veronese, et al. (i9 85 ) A s 

«. **:5 199 -5202>. Assays were performed using an 
is antigen-capture ELISA kit (Coulter, Hialeah, FL) 

inhibition of HIV-i production increased with 
cotransfection of increasing amounts of rip expression 
vector relative to a constant amount of P NL4-r H iv-i 

,0 HIv e ™ a Lr 0ne (Fi9 ' 5) * EXpreSSi - ° f a co-transfected 
>0 HIV LTR-driven CAT reporter gene in these cells was 

unaffected by RIP expression, demonstrating that 

inhibition of virus production was not due to 

transcriptional effects, or non-specific toxicity 



25 



30 



CP Inhibits rr „ n ,_ T1nn in _ . _ 

as lymphocytes are a primary target of HIV-i 
infection In vivo, further experiments were performed in 
lymphocyte cell lines, m CEM cells, cotransfection of 
RIP-encoding DMA at a transfection ratio of 2:1 (4 M g 
PNL4-3 and 8 nq pcRIP) resulted in inhibition of HIV-i 
antigen production up to twenty-fold greater than that 
mediated by cotransfection of the dominant negative Rev 
mutant M10 (Fig. 6 A) . Previous studies demonstrating 
potent inhibition by Rev M10 in similar cotransfection 
studies have used transfection ratios of 10:1 (Kalim et 
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al. (1989), supra). Thus, RIP is a more potent inhibitor 
requiring a two-fold excess (2:1) of Rip to inhibit 
whereas a 10-fold excess (10:1) of M10 is required for 
inhibition of HIV-l antigen production. Cell viability 
5 and cell growth were unaffected by Rip expression as 
tested using well known techniques of cell counting and 
trypan blue assay. Cotransf ection of a secreted alkaline 
phosphatase reporter gene (pCMU/SEAP) (Tropix Inc., 
Bedford, MA; Cullen, B. et al. (1992) Meth. Enzyme, 

10 21^:362-368) showed that RIP did not impair the ability 
of these cells to express transf ected genes (Fig. 6B) . 
Decreased expression of alkaline phosphatase at later 
time points in cells cotransf ected with Rev M10 or pcDNAl 
may reflect ceil killing by HIV-l. Similar 

15 cotransf ection experiments performed in H9 lymphocytes 
also demonstrated the ability of RIP to inhibit virion 
production in a dose dependent manner (Fig. 7) . 

T cell lines, CEM and H9 were maintained in 
suspension culture in RPMI 1640 media (ICN Biochemicals, 

20 Cosa Mesa, CA) supplemented with 10% fetal bovine serum 
(GIBCO, Grand Island, NY) • 

Example 9; RIP Inhibition of HIV Is Not Due to 

Antisense-Mediated Inhibition of MCM2 Expression 

Since most of the RIP peptide has been shown here 

25 to be encoded by the antisense strand of a human MCH2 
homologue, an important control is to show that the 
observed inhibitory affect on HIV-l production was not 
due to antisense transcript-mediated inhibition of 
endogenous hMCM2 . The yeast MCM2 protein has been shown 

30 to be involved in plasmid maintenance (Yan, H. et al. 

(1991) Genes and Development, 5:944-957), and is thought 
to be involved in the initiation of DNA replication 
(Yan H. et al. (1991), supra). A frame shift mutation 
was generated directly after the ATG in the RIP 
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expressing vector. This placid therefore retains all of 
the antisense sequence, however it is not able to express 
a Rev binding peptide. Pig. 7 shows ^ such a Xpr — 
construct (pc ARIP) has no antiviral activity. 




10 



15 



20 



25 



30 



The rip of the invention as well as anti-Rip 
antibodies are useful in an assay for the presence of Rev 
in cells of a patient. Thus, the assay is useful as a 
diagnostic tool to determine whether cells (e.g. 
lymphocytes) of a patient contain the HIV virus. 

To determine the presence of actively replicating 
HIV in a cell population, lymphocytes of a human 
suspected of being infected with HIV-i are collected, 
suspended in an appropriate culture medium to induce' 
cellular and viral replication and induce expression of 
viral proteins such as Rev. The cells are then lysed and 
the supernatant containing Rev is contacted with RIP 
immobilized on the surface of the well of a standard cell 
culture dish as shown in Fig. 8 (steps A and B) . After 
specific binding of Rev to the immobilized RIP, the 
remaining, unbound proteins of the lysate are removed by 
washing (Fig. 8 , step C) . Next, an anti-RiP antibody is 
contacted with the proteins in the cell culture well 
(Fig. 8, step D). The anti-RiP antibody binds to the 
remaining free RIP peptides. When Rev protein is not 
present in the lysate, the signal is high. When the 
lysate contains Rev, the signal is reduced due to a 
decrease in the number of accessible RIP proteins in the 
well. 

The anti-RiP antibody can be directly labelled 
with a detectable tag such as radio label ling with 125 i or 
a covalently bound fluorescent moiety, e.g., fluorescein. 
Further, unlabeled anti-RiP antibodies can be used, and 
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the signal generated by the secondary binding of a 
labeled antibody to anti-RIP antibody. Further, it is a 
feature of the invention that, under the conditions of 
the assay, the anti-RIP antibody binds RIP with reduced 
5 affinity relative to the binding of Rev to RIP* Thus, 
the Rev protein remains bound to RIP during the assay for 
reliable qualitative or quantitative determination of the 
presence of Rev in cells. 

The assay is also useful in screening candidate 

10 agents that inhibit viral production by inhibiting the 
expression of essential viral proteins such as Rev. 
According to this embodiment of the invention, a cell 
culture containing actively replicating HIV is contacted 
with a candidate agent in an appropriate culture medium. 

15 The cells are assayed as above for Rev protein. 

Successful agents are those which reduce the amount of 
Rev protein by at least approximately 90% in the treated 
cells relative to untreated control cells. 

Standard protocols are followed to carry out an 

20 assay method of the invention. For example, after 
bringing the immobilized RIP into contact with a cell 
lysate, the surface is washed to remove unbound material. 
Further, standard blocking protocols are generally used 
prior to contacting the washed surface with labeled 

25 antibodies. An alternative to using cells infected with 
HIV is to provide acutely virus infected cells. For 
example, lymphocytes are obtained from uninfected persons 
and infected with a defined amount of HIV. The cells are 
then treated to remove input virus and cultured. 

30 Isolation of RIP of other selected random peptide- 

scaffold fusion proteins and production of antibodies to 
them are performed using standard techniques well known 
to those of ordinary skill in the art. The selected 
random peptide-scaf f old peptide of the invention can be 

35 produced by first transforming a suitable host cell with 
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th. nucleotide sequence encoding the random peptide- 

lZll? t illiz p r ein cloned lnto a 8uitawe 

5 n„, Th ,?" ° r ° rdinar y ^ th. field of molecular 

e^reT «»* any of a wide variety of 

«cpress l0 n system can be used to provide th. protein or 
Polypept.de. The precise host cell used is not critical 
to the invention. The polypeptide can b. produced in a 

(e.g., SaccharomycBs cerevisiae) . The method of 
transformation of the cells and the choice of expression 

iTi^t 1 depend on ^ host syste - -lected^ ::L: ds 

described herein provide sufficient guidance to 
is successfully carry out the production, purification and 
identxfxcatxon of rip or other selected random peptide- 
scaffold fusion protein. 

once the RIP or other selected random peptide- 
scaffold fusion protein is expressed, it is isolated, 
e.g., using imaunoaf f inity chromatography. i„ one 
example, an anti-Rip antibody can be attached to a column 
and used to isolate RIP. Lysis and t ra ctionation of Rip- 
oontaining host cells prior to affinity chromatography 
can be performed by standard methods. Once isolated, the 
25 recombinant protein can, if desired, be further purified 
e'g" Pi h ±9h Perf0rBanCe li « uid chromatography (see, 

Wg l eculnr P1o1o^ reds!T Work^nd^BiJdonl ^ElsIvilrT ^ 
(1980)). rip or other peptides can also be produced by 
30 chemical synthesis by standard solution or solid phase 
peptide synthesis techniques. 

Substantially pure RIP or other random peptide- 
scaffold fusion protein can be used to raise antibodies. 
Antibodies directed to the polypeptide of interest are 
35 produced as follows. Peptides corresponding to all or 
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part of the polypeptide of interest are produced using a 
peptide synthesizer by standard techniques, or are 
isolated and purified as described above. The peptides 
are coupled to KLH with m-maleimide benzoic acid N- 
5 hydroxysuccinimide ester. The KLH-peptide is mixed with 
Freund's adjuvant and injected into animals, e.g. guinea 
pigs or goats, to produce polyclonal antibodies. 

Monoclonal antibodies can be prepared using the 
polypeptide of interest described above and standard 

10 hybridoma technology (see, e.g., Kohler et al., Nature 
(1975) 25^:495; Kohler et al., Eur. J. Immunol. (1976) 
6:292; Kohler et al., Exit. J. Immunol. (1976) 6:511; 
Hammerling et al., in Monoclonal Antibodies and T Cell 
HYfrriflpmas, Elsevier, NY, (1981), which are incorporated 

is herein by reference) . Antibodies are purified by peptide 
antigen affinity chromatography. 

Once produced, antibodies to RIP, for example, are 
tested for specific RIP binding by Western blot or 
immunoprecipitation analysis by standard techniques. 

20 Uses 

The method of isolating a random peptide-scaf f old 
peptide fusion protein is useful to discover peptides 
that bind specifically to selected target proteins. As 
described above, the binding of the random peptide is 

25 enhanced by the 3-dimensional structure assumed in the 
context of the stably folded scaffold peptide* 

The fused scaffold structure is also useful to 
determine the 3 -dimensional structure of the random 
peptide such that small organic molecules can be designed 

30 to mimic some of its structural features. These small 
organic molecules are useful as potential therapeutic 
agents where binding to the target protein affects its 
function, e.g., inhibition of Rev protein. 
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15 
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25 



30 



35 



The isolated random peptide-scaf fold fusion 
protein RIP is useful as an inhibitor of HIV production 
in cells, such as human lymphocyte cells, rip and 
antibodies to RIP are useful in an assay for the presence 
of Rev in a lysate of cells suspected of being infected 
with HIV. 

RIP can be used to treat or prevent HIV infection 
by introducing a DNA sequence (SEQ ID NO:l) encoding rip 
(SEQ ID NO:2), or Rev interacting analogues of the 
original R32-scaffold fusion, into the genome of a human 
cell infected with, or likely to be infected with, HIV. 
RIP is expressed in the cell at sufficient concentration 
within the cell to inhibit Rev function and, thereby 
inhibit HIV replication. 

cells which are transfected with RIP-encoding DNA 
sequence include cells that can be infected by HIV such 
as CD4+ expressing cells. In one method, bone marrow 
stem cells from an infected patient are removed from the 
patient and then infected with a an appropriate viral 
vector containing RIP-encoding DNA sequence by standard 
gene therapy techniques well known in the art (see e.g., 
Horsy, M.A. et al. (1993) JAMA 22£: 2338-2345) . The 
patient's remaining bone marrow is preferably destroyed 
by irradiation or chemical ablation to eliminate any HIV- 
infected stem cells that do not express Rip. The 
transfected stem cells are reintroduced into the host 
from which they were isolated (autologous 
transplantation) by injection into a vein of the host 
(SSfi, Biron, P. et al. (1993) in Autologous. Bone Marrow 
Transplantation; Proceedings of the First Tnternational 
Symposium , Dicke, K.A. et al. (eds.), p. 203 for relevant 
techniques) . 

The transfected stem cells are allowed to 
repopulate the bone marrow of the patient and provide 
CD4+ cells expressing RIP which inhibits HIV replication 
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in these cells when infected with the virus. This 
results in simultaneously treating and protecting the 
host from disease resulting from HIV infection and 
replication. Allogenic transplantation of stem cells 
s expressing RIP from an uninfected host is another 
embodiment of the invention and provides the same 
protection to transfected cells and their progeny from 
subsequent HIV replication. 

Another use for a random peptide isolated by the 
io method of the invention involves analyzing the structure 
of the random peptide in the context of the scaffold 
peptide for designing small molecule therapeutic agents. 
First, DNA encoding a random peptide is inserted into the 
coding sequence of the scaffold peptide, e.g., the Bl 
15 domain of G protein. In this context, the random peptide 
is displayed on the surface of a stable protein and 
therefore is less likely to be degraded. Also the 
structural constraints of the scaffold may maintain the 
peptide in an active conformation. The structure of the 
20 random peptide necessary to interact with the target 
protein is deduced by comparing the 3 -dimensional 
structure of the scaffold peptide domain with and without 
the inserted random peptide sequence, e.g., by using NMR 
analysis (Heinz et al. (1994) J. Hoi. Biol. 869-886) . 

25 Such an analysis would not be possible without the 

structural stabilizing effects of the scaffold peptide 
domain in combination with the previously solved NMR 
analysis of the scaffold structure. 

Peptides that are not constrained by a scaffold 
30 peptide domain may not have a defined structure in 
solution. They assume a specific structure only when 
binding to their target protein, i.e., an induced fit. 
Structural analysis of the random peptide inserted in a 
scaffold peptide domain obviates the need for analysis of 
35 the target protein-random peptide complex. 
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Three-dimensional information of an effective 
bxnding structure provides useful information for 
designing small organic molecules that can also bind the 
target protein. For example, a small organic molecule 
which mimics an aspect of the 3-dimensional structure of 
RIP can be designed and tested for its ability to inhibit 
HIV production in cells. 



WO 96/23899 



PCT/US95/09589 



- 31 - 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 
(i) APPLICANT: 

5 (ii) TITLE OP INVENTION: 

(ill) NUMBER OF SEQUENCES: 



University of Massachusetts Medical 
Center 

METHODS OF SELECTING A RANDOM 
PEPTIDE THAT BINDS TO A TARGET 
PROTEIN 

7 



10 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson 

(B) STREET: 225 Franklin Street, Suite 3100 

(C) CITY: Boston 

(D) STATE: MA 

15 <E) COUNTRY: USA 

(F) ZIP: 02110-2804 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

20 (C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBERS 

(B) FILING DATE: 



25 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/381,800 
<B) FILING DATE: 01-FEB-1995 

(viil) ATTORNEY / AGENT INFORMATION: 

(A) NAMEs Clark, Paul T. 

30 (B) REGISTRATION NUMBER: 30,162 

(C) REFERENCE / DOCKET NUMBER: 04020/062WO1 

(ix) TELECOMMUNICATION INFORMATION s 
<A) TELEPHONES (617)542*5070 

(B) TELEFAX: (617)542-8906 
35 (C) .TELEX: 200154 



(2) INFORMATION FOR SEQ ID NOsls 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : single 

(D) TOPOLOGY s linear 

(11) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOsls 
ACGTACCTTG TGCTTGCCAC CTGGGTTTTT GGGCTCCCCT CCGAACAGGG CCAGAGCCAG 60 
ATGTCTTCAT GACCAGAGAT GGAAGGAGCA ATGCTGGCAA AGATCTTCTC 120 
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TCOGATCTCC TGATCCTTGG AGAGGCTAGT CATCATCTTC ACATCTTCAT CGGTCCTGGC 
GAGGACACTG CCCCCCGCTC GAAGCTTTCG ACTTCTTCCC CATTGGTCAA GTCTCCAATC 
AAGGTTGTCG GCTTCTCTAC CTTGCCAGAA ATTTACGAAA ACATCCAAAA GGGTCAAATC 
CTTACATACG TTCTTGACAC TTCTAAATAA 

P2> INFORMATION FOR SEQ ID NO: 2* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 109 amino acide 

(B) TYPE: amino acid 

( c > STRANDEDNESS : not relevant 
10 (O) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO* 2: 

Thr Tyr Le* Val Leu Ala Thr Trp Val Phe Gly Leu Pro Ser Glu Gin 

10 15 

Cly Gin Ser Gin Al. Ser Leu Asp v.l Ph . Met Thr Arg Aep Gly Arg 

25 30 
Ser Aan Ala Gly Ly. Asp Leu Leu S.r Asp Leu Leu lie Leu oly Glu 

Al. Ser Asp Hi. Leu Hi- lie Phe He Gly Pro Gly clu His Thr Gly 



180 
240 

300 
330 



IS 



20 



60 



25 



Oly Arg Ser Lys Leu Trp Thr Ser Ser Pro Leu Val Lys Ser Pro He 

75 80 
l.y. V.1 v.l Cly Leu Ser Thr Leu Pro Glu He Tyr Glu Lys Met Glu 

90 95 
I-YB Gly Gin lie Val Arg Tyr Val Val Aop Thr Ser Lye 



(2) INFORMATION FOR SEQ ID NO:3: 



30 



35 



(i) SEQUENCE CHARACTERISTICS * 

(A) LENGTH* 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS * not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION* SEQ ID NO* 3: 

Leu Val Ly. Ser Pro He Lye Val Val Gly Leu Ser Thr Leu Pro Clu 

S 10 15 

!!• Tyr Glu Lye Met Glu Ly. Gly Gin He Val Arg Tyr Val Val Aep 
20 25 30 
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Thr Ser Lys 

35 



(2) INFORMATION FOR SEQ ID NOi4; 

(I) SEQUENCE CHARACTERISTICS t 
5 (A) LENGTH : 175 base pairs 

(B) TYPEs nucleic acid 

(C) STRAND ED NESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 1 

ACGTACCTTG TGCTTGCCAC CTGGGTTTTT GGGCTCCCCT CCGAACAGGG CCAGAGCCAG 60 
GCCTCTCTTG ATGTCTTCAT GACCAGAGAT GGAAGGAGCA ATGCTGGCAA AGATCTTCTC 120 
TCCGATCTGC TGATCCTTGG AGAGGCTAGT GATCATCTTC ACATCTTCAT CGGTC 175 

(2) INFORMATION FOR SEQ ID NO: 5: 

15 (i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 58 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Thr Tyr Leu Val Leu Ala Thr Trp Val Phe Gly Leu Pro Ser Glu Gin 
1 5 10 15 

Gly Gin Ser Gin Ala Ser Leu Asp Val Phe Met Thr Arg Asp Gly Arg 
25 20 25 30 

Ser Asn Ala Gly Lys Asp Leu Leu Ser Asp Leu Leu lie Leu Gly Glu 
35 40 45 

Ala Ser Asp His Leu His lie Phe lie Gly 

50 55 



3(B) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(Xt) SEQUENCE DESCRIPTION x SEQ ID NOi6: 
TTGCTCAACT CTCCAATCAA CCTTGTCCCC TTCTCTACCT TCCCAGAAAT TTACGAAAAG 60 
ATGGAAAACG GTCAAATCGT TACATACGTT CTTGACACTT CTAAATAA 108 
(2) INFORMATION FOR SEQ ID NO: 7 : 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 23 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Pro Val Pro Leu Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cy> 

Aon Glu Aap Cya Gly Thr Ser 

15 20 



15 



Other embodiments are within the following claims. 
What is claimed is: 



WO 96/23899 



PCT/US95/095S9 



- 35 - 

Claims 

1. A method of isolating a random peptide that 
binds a predetermined target protein, said method 
comprising: 

5 a) providing a library of nucleotide sequences 

encoding random peptides, each of said sequences encoding 
a candidate random peptide; 

b) cloning a sequence from said library into a 
first yeast two-hybrid expression vector to make a 

10 library test vector expressing a random peptide-scaf fold 
fusion protein comprising 

i) an N-terminal transcriptional activation 

domain, 

ii) said candidate random peptide, and 
15 iii) a scaffold peptide domain; 

c) cloning a nucleotide sequence encoding a target 
peptide domain into a second expression vector to make a 
target vector expressing a target fusion protein 
comprising 

20 i) an N-terminal DNA binding domain, and 

ii) a C-terminal target binding domain; 

d) introducing said library test vector and said 
target vector into a yeast cell wherein said cell 
contains a marker gene expressed when the test fusion 

25 protein binds to the target fusion protein; and 

e) isolating said random peptide-scaf fold fusion 
protein from a cell in which said marker gene is 
expressed. 

2. A method of claim 1, wherein said random 

30 peptide sequence is inserted within a scaffold peptide 
domain. 

3. A method of claim 1, wherein said scaffold 
peptide domain is a peptide comprising SEQ ID NO: 3. 
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4. A method of claim l, wherein said scaffold 
peptide domain is a peptide comprising the Bl domain of g 
protein. 

5. A method of claim l, wherein said scaffold 
peptide domain is a peptide comprising a zinc finger 
domain, the B domain of protein A of s. aureus, the tenth 
type III module of fibronectin, or cyclophilin. 

6. Substantially pure DNA of SEQ ID NO:i, or 
degenerate variants thereof, encoding an HIV-l Rev 

io binding peptide. 

7. A substantially pure peptide comprising an 
amino acid sequence substantially identical to SEQ lj> 
NO: 2, which peptide binds to HIV-l Rev protein. 

8. Substantially pure DNA of SEQ ID NO: 4, or 
is degenerate variants thereof. 

9. a substantially pure peptide comprising an 
amino acid sequence substantially identical to SEQ ID 
NO: 5. 

10. An antibody that binds specifically to the 
20 peptide of claim 7. 

11. A vector containing the DNA of claim 6 
operably linked to transcriptional and translational 
regulatory sequences, wherein said vector expresses said 
DNA as an HIV-l Rev binding peptide in a mammalian cell. 
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- 12. A method of inhibiting HIV-1 replication in a 
mammalian cell, said method comprising: 

a) introducing a vector of claim 11 into a 
mammalian cell infected with HIV-1; and 
s b) expressing said HIV-1 Rev binding peptide in 

said cell at a concentration within said cell that 
inhibits HIV-1 replication such that virion production is 
reduced within said cell. 

13. A method of treating a human for HIV-1 
10 infection, said method comprising: 

a) providing a vector of claim 11 in a 
pharmaceutical^ acceptable carrier; 

b) introducing said vector into a human cell such 
that said DNA sequence is incorporated into the genome of 

15 said cell and said HIV-1 Rev binding peptide is expressed 
in sufficient concentration in said cell to inhibit 
virion production; and 

c) administering said vector-containing cell into 
said human such that said cell and its progeny are viable 

20 in said human, wherein said cell and its progeny inhibit 
virion production. 

14. A method of detecting human immunodeficiency 
virus (HIV) in a sample of human CD4* cells, said method 
comprising: 

25 a) providing a sample of human CD4+ cells; 

b) inducing replication of any HIV in said cells; 

c) lysing said cells and collecting a cell 
supernatant containing cellular and viral proteins; 

d) contacting said cell supernatant with a peptide 
30 of claim 7, and allowing formation of peptide-Rev protein 

comp 1 exes ; and 

e) detecting said peptide-Rev protein complexes as 
an indication of HIV in the cell sample. 
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, K ^ , A "• thod ° f sc "«ning for candidate agents 

that inhibit production of HIV by inhibiting production 
of Rev protein in a human cell, said method comprising: 

a) providing a population of human cells 
containing actively replicating HIV i„ culture medium; 

b) contacting said population of cells with a ' 
candidate agent to inhibit Rev production; 

c) lysing said cells and collecting a cell 
supernatant containing cellular and viral proteins; 

d) contacting said cell supernatant with the 
peptide of claim 7 and allowing formation of peptide-Rev 
protein complexes; 

e) detecting said peptide-Rev protein complexes as 
an indication of Rev protein production in the cell 
sample; 

f) repeating said steps a), c) , d) , and e) to 
establish a control sample; and 

g) comparing said Rev protein production in the 
presence of said candidate agent with Rev protein 
production in said control sample to screen candidate 
agents that inhibit production of HIV. 
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